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Abstract 

We prove that a balanced Boolean function on Sn whose Fourier transform is highly concentrated 
on the first two irreducible representations of Sn, is close in structure to a dictatorship, a function 
which is determined by the image or pre-image of a single element. As a corollary, we obtain a 
stability result concerning extremal isoperimetric sets in the Cayley graph on Sn generated by the 
transpositions. 

Our proof works in the case where the expectation of the function is bounded away from and 
1. In contrast, 6, deals with Boolean functions of expectation 0(l/n) whose Fourier transform is 
highly concentrated on the first two irreducible representations of Sn- These need not be close to 
dictatorships; rather, they must be close to a union of a constant number of cosets of point-stabilizers. 

1 Introduction 
1.1 Background 

This paper (together with [5] and [7]) is part of a trilogy dealing with stability and quasi-stability results 
concerning Boolean functions on the symmetric group, which are of 'low complexity'. 

Let us begin with some notation and definitions which will enable us to present the Fourier-theoretic 
context of our results. Following this, the paper will be essentially Fourier-free, since Lemma [2] translates 
the relevant Fourier notion into a combinatorial one. 

Let Tij = {a Cz Sn '■ <j{i) — j}, henceforth a 1-coset. Similarly, for t > 1, and for two ordered i-tuples 
of distinct elements, / = (ii, . . . , it) and J — (ji, . . . , jt), we let Tjj — {a G Sn ■ cr{I) — J}, henceforth 
a t-coset. Abusing notation, we will also use Tij and Tjj to denote their own characteristic functions. 

We say that a Boolean function /; 5„ — > {0, 1} is a dictatorship if it is a sum of T^ 's. Note that 
since the value of the function is either or 1, the summands indicate disjoint events, and therefore a 
dictatorship is determined by either the image or the pre-image of a single element. 

For i > 1, we say that a Boolean function is t- controlled if it is a sum of disjoint t-cosets. Unlike in 
the case t — 1, such a function need not depend upon the image or pre-image of just one i-tuple, e.g. 

7'(l,2)(l,2) +r(i,3)(4,5)- 

For any non-negative integer t, let Ut be the vector space of real- valued functions on Sn whose Fourier 
transform is supported on irreducible representations indexed by partitions of n, whose largest part has 
size at least n — t. (All partitions A for which A ^ {n — t, 1*), where ^ refers to the lexicographical order 
on partitions.) If / is a real-valued function on Sn, we define the degree of / to be the minimum t such 
that f £ Ut- This is a measure of the complexity of /, analogous to the degree of a Boolean function on 
{0, 1}". Indeed, it is precisely the minimum possible total degree of a polynomial in the Tij's which is 
equal to /. 

Note that Uq is the space of functions whose Fourier transform is supported on the trivial represen- 
tation — i.e., the space of constant functions. The space Ui, which is the main subject of this paper, is 
the space of functions whose Fourier transform is supported on the two irreducible constituents of the 
permutation representation. As promised, we now de-Fourierize this definition. 

First, an easy fact: 
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Lemma 1. For any t G N, the indicators oft-cosets (the functions Tij), are in Ut- 

Next, a slightly more intricate fact, observed and proved in [5]: 

Lemma 2. The Tjj 's span Ut. 

Therefore, in this paper, which deals with Ui, we will only use the definition Ui — SpanjTy}. 
Finally, we recall a theorem from 8 , which characterizes the Boolean functions in Ut- 

Theorem 1 (Ellis, Friedgut, Pilpel). Let / : 5,i — > {0, 1} be in Ut- Then / is t-controlled. 

Now, the goal of the current paper, together with [5] and [7], is to provide stability versions of this 
theorem. This is in the spirit of similar projects in the Abelian case, which have proved extremely useful 
and apphcable, see e.g. [3], [H], [ID], [12]; [IS] and [T3]. The general idea in apphcations is to prove 
results in extremal combinatorics using Fourier analysis, and then use the Fourier stability results in order 
to deduce combinatorial stability results. A good example of this is Theorem [Sj in this paper, where we 
characterize the almost-extremal sets for the edge-isoperimetric inequality in the transposition graph on 
Sn (the Cayley graph on Sn generated by the transpositions) . See [B] for more about applications in the 
symmetric group setting. 

The division between the three papers in our trilogy is as follows: in [6], we deal with Boolean 
functions which are close to C/i, and have expectation 0{l/n). We prove that such a functions must be 
close to a sum of dictatorships — equivalently, close to a union of 1-cosets. In the current paper, we 
prove that Boolean functions that are close to J7i, and whose expectation is bounded away from and 
1 , must be close to a single dictatorship. Finally, in [7] , we deal with Boolean functions close to Ut , with 
expectation 0(n~*); we prove that they must be close to a union of i-cosets. The term 'quasi-stability' 
in the titles of the other two papers refers to the fact that a Boolean function such as Tn + T22 — Tn • T22 
is 0(l/rt^) close to Ui, and is indeed 0(l/n^) close to Tn + T22, which is a sum of two dictatorships, 
but is not 0{l/n^) close to any single dictatorship. In the case studied in this paper, we have bona fide 
stability, as the functions in question turn out to be close to a dictator. For both ranges of expectation 
we have studied, however, the Boolean functions which are close to Ui are close to a union of 1-cosets. 
Interestingly, the proof in this paper is quite different from the proof in [6]. It would be interesting to 
find a common proof that covers the complete spectrum of possible values of IE[/]. 

The outline of the paper is as follows. In the rest of this section, we set up our notation, state our 
main result, and outline the proof. In section [21 we prove the main theorem for the case of functions 
with expectation = 1/2. Next, in section [3J we adapt the proof to deal with functions with expectation 
bounded away from and 1. Finally, in section |4j we give an application of our main theorem: a 
characterization of the almost-extremal sets for the edge-isoperimetric inequality for the transposition 
graph on Sn- 

1.2 Notation 

The characteristic vector of a set S is denoted by xs- If -B is a statement, the indicator Is is equal to 1 
if B is true and if i? is false. 

Let Sn denote the group of all permutations of [n] — {1, . . . ,n}. For each i,j e [n], we define 

T^j = {tt e S'n : Tr{i) = j} 

to be the set of all permutations sending i to j; we call these the 1-cosets of Sn, as they are the cosets of 
stabilisers of points. Abusing notation, we will also use Ty to denote its own characteristic vector, more 
properly written as XTij ■ 

We define Ui{n) to be the subspace of M.^" spanned by {Tij : i,j € [n]}. When n is understood, we 
abbreviate this to Ui. 

We equip M " with the inner product induced by the uniform probability iTLG^sure on Sn- 
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The expectation of a real-valued function on Sn will mean the expectation with respect to the uniform 
probability measure, i.e. 

We let II • II2 denote the induced Euclidean norm; i.e. 

ll/ll2 = yE[7^= /J^Tw^. 

The distance between functions, or between a function and a subspace, will mean the Euclidean distance 
as defined by this norm. 

The notation x ± e is shorthand for the closed interval [a; — e, x + e]. If y € x ± e, then we say that y 
is e-close to x. If y ^ x ± e, then we say that y is e-far from x. For a set S", we say that x is e-close to 
5 if |x — y| < e for some y G S. Otherwise, we say that x is e-far from S. 

We will be dealing throughout with functions on finite probability spaces (i.e., with random variables); 
we will frequently refer to these simply as 'functions' (rather than as 'random variables'), when the 
underlying probability space is understood. 

1.3 Main result 

Our main goal in this paper is to prove the following theorem. 

Theorem 2. Let T d Sn he a family of permutations with size |J^| = c • n\, satisfying 

m - hf] = ei, 

where f = 2%^ — 1, and fx is the orthogonal projection of f onto U\. Then there exists a family Q C Sn 
which is a union of dn disjoint 1-cosets, such that 

where r] = min{c, 1 — c} . 

Remark. It is convenient to work with the ±1 -valued function f, rather than the 0/1-valued function 
XT- Note that if {xt)i denotes the orthogonal projection ofxT onto Ui, then the square of the Euclidean 

distance of xf from Ui is 

Throughout the proof, a Boolean function will mean a function taking values in {±1}, rather than {0, 1}. 
For the entire proof, we will make the assumptions 

n>4, (1) 
^ < ei < eo, (2) 

where fo > depends only upon c. Later, we will show how to get rid of these assumptions. During 
the proof, we will use the phrase since ei is small enough, P holds to mean that for some eo > 0, the 
statement P follows from ei < cq. 

For pedagogical reasons, we will first assume; that c = 1/2. This assumption does not affect the proof 
very much, but it simplifies the expressions appearing therein. After completing the proof in this case, 
we will show how to extend it to general c, carefully noting the relation between eo and c. 
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1.4 Proof overview 



We adopt a simple canonical way to express /i as a linear combination of the form 



we then study the matrix of coefficients {dij). This offers a nice visualization of the function, due to the 
observation that 



i.e. /i(7r) is equal to the sum of the entries on a generalised diagonal of the matrix (ay ). (A generalised 
diagonal of an n x n matrix is a set of n entries with one entry from each row and one from each column, 
so corresponds to a permutation of {1, 2, . . . , n}.) 

Note that the Tij's are linearly dependent (the dimension of [/i is only {n — 1)^ + 1, whereas there 
are different Tij's), so there are many possible ways to represent /i in such a manner. It turns out 
that a particularly useful choice (when c — 1/2) is 

a,;,-(n-l)(/,T,,). [(3)] 

To illustrate this, here is the matrix corresponding to the dictatorship = {a e 5„ : 1 < cr(l) < n/2} 
(where n is even): f 



1- i 
i" 



i" " 1 



This matrix exemplifies the usefulness of our choice Uij = (n— 1)(/, Ty): the entries that are significant 
for our dictatorship (row 1, which depends on the image of 1) are all close in absolute value to 1, whereas 
all other entries are close to 0. The idea of the proof is to discover some properties of the matrix (a^ ), 
and then show that they imply that it looks roughly like the matrix above — namely, that it has precisely 
one row or column in which almost half the entries are very close to 1 and almost half the entries are 
very close to —1, and that almost all the other entries in the matrix are very close to 0. 

The proof breaks down into two main parts. In the first part, we show that for almost all tt G S'„, the 
generalised diagonal defined by tt, namely {ai7r(i) '■ 1 < * < f^}, has precisely one entry which is 'large' 
(close to 1 or —1), and all the rest of its entries are small. In the second part, we deduce that (aij) must 
have either a row or a column, almost all of whose entries are large. This will enable us to complete the 
proof. 

Part 1 



Step 1 ( § 2.2 | § 2.3 ). Consider any two sets X,Y C {1, . . . , n} with \X\ = |y |, and the corresponding 
set of permutations 

Tx,Y = {tt e Sn ■■ 7t{X) = Y}. 

When calculating /i on Tx,y, we only need to look at the submatrices of (a^ ) defined hy X xY and by 
X X Y. So it is natural to define, for tti a bijcction from X to Y and tt2 a bijection from X to Y, 

diM = ^a,^iii), 52(7^2) = ^ flj^^C*)' 5(7^1, 7r2) = 31 (vTi) +52(71-2), 



iex 



iex 



where (7ri,7r2) denotes the permutation of Sn whose restrictions to X and X are tti and 1^2 respectively. 
Notice that g is just the restriction of /i to Tx.y- For most choices of X,Y, these functions are 'well- 
behaved', meaning that all of the following hold ( |§ 2.2[ ): 



For most permutations n e Tx,y, g{T^) is close to ±1. 
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• Furthermore, the function g is close to ±1 in an sense. 

• Both Egi and are close to their expected value, 0. 

Next, we note that, crucially, the function g (which can be viewed as a random variable) is the sum of 
two independent random variables gi,g2, and yet is concentrated near 1 and —1. How can that happen? 
We show ( |§ 2.3[ ) that it must be the case that one of the gi's (say gi) is concentrated around a constant 
C, and that the other (say 32) is concentrated around two values, — C — 1 and — C + 1. Using the 
observations above, it follows that C is very close to 0. 

Step 2 ( |§ 2.4[ ). For any permutation tt G Sn, we consider all pairs {X,Y) compatible with it, i.e. 
all pairs {X, tt{X)). For most choices of tt and for most choices of compatible {X, Y), it will be true that 
one of 5i(7ri), (72(7r2) is close to 0, and the other is close to ±1. Note that 

5l('^l) = X! ""(i), 52(71'2) = ^ ai7r(i)- 

*6-y iex 

Put differently, for most permutations tt G 5„ it is true that for most ways of splitting the generalised 
diagonal D = {ai^(i) : 1 < z < n} into two parts, one part sums to roughly 0, and the other to roughly 
±1. That can only happen if almost all entries in D are small, and one is close in magnitude to 1. 



Part 2 (§ 2.5) 



This part uses induction on n to prove the following claim. If an n x n matrix satisfies property Q{S), 
namely a (1 — 5)-fraction of its generalised diagonals have a single entry which is large in magnitude, 
then the matrix has a strong line — either a row or a column, a (1 — C(5)-fraction of whose entries are 
large. (Here, C does not depend on n.) 

Base case. When n is small compared to 1/5, we can prove directly that there is a line where all of 
the entries are large. 

Induction step. Given an n x n matrix M satisfying Q{S) and a set X of n/2 rows, we can always 
find a set Y of n/2 columns such that either X xY ot X xY also satisfy Q{S). The induction hypothesis 
shows that the relevant submatrix has a strong line. The strong lines for different choices of X must be 
the same (on the same row or column of M), since otherwise the probability that a generalized diagonal 
passes through two large entries would be too big. Altogether, these strong lines constitute a line £ which 
is almost as strong as required. A small bootstrapping argument shows that £ must indeed have the 
required number of large entries. 



Culmination ( § 2.6 ) 

At this stage of the proof, we know that the matrix (a.y ) has a line, say row i, almost all of whose entries 
are close either to —1 or to 1. It follows that for most j, it holds that (n — !)!(/, Tij) is close to or to 1. 
The disjoint union of the 1-cosets corresponding to those entries close to 1 form a good approximation 
to T. 



Part 2 is largely independent of Part 1. Part 1 shows that most generalized diagonals of the matrix 
(aij) are composed of one large entry and n — 1 small entries. Part 2 abstracts this situation, and deduces 
the existence of a strong line. The results of Part 2 work for any definition of which entries are large and 
which are small, and so are of independent interest. 



Glossary of terminology 

defined in the end of 



Restrictions are defined in the beginning of § 2.2 



2.2 Good restrictions are defined in the beginning of § 2.5 



Typical restrictions are 
Functions which are 



almost Boolean or almost close to C are defined in |§ 2.2[ just before iLemma 6l Partitions, good partitions 
and good permutations are defined in the beginning of |§ 2.4[ Small and large entries are defined in the 
beginning of |§ 2.5[ Strong lines (as well as strong rows and columns) are defined in |§ 2.5[ just after 
ILemma 131 
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2 Proof when c = 1/2 
2.1 Matrix representation 

Let T and / be as in the statement of the theorem. Since /i e \J\, it can be represented as a Unear 
combination of 1-cosets Ty. We single out one such representation: 

ai, = (n-l)(/,T,,-). (3) 

We start by showing that the aij do indeed represent /i. 

Lemma 3. We have 

fi = dijTij. 

Furthermore, each row and each column of the matrix {atj) sums to zero: 

^ aij =0 Vi e [n], ^ ay =0 Vj e [n]. 

For eac/i permutation it G Sn, we have 

i 

Proof. The 'real' proof of this fact uses the Fourier inversion formula, and the characters of the first two 
irreducible representations of Sn- This is how we derived the formula. However, to avoid digression, we 
offer a simpler, ad hoc argument. 

The second statement follows from a simple calculation. For each i, 

5^(/,T,,-) = (/, 1) = 2{x:f, 1) - (1, 1) = 0, 

j 

so 

J2 a^J = 0. 

j 

Similarly, for each j, 

^{f,T,j) = if, 1)=0, 

i 

SO 

Y,aij =0. 

i 

For the first statement, both sides of the equation are in Ui, so it is enough to show that both sides 
have the same inner product with each T^. Note that {Tij,Tij) = 1/n, {Tij,Tki) = \i i ^ k and 

j ^ and {Tij,Tki) = if i ^ k or j ^ I. Therefore, 

{J2^kini,T,j) = ^ + ^^^^afc, = ^ + ^^^^ = i^x= = <h,Tio). 

k,l ^ I ki^i ^ ' 

using 

k^i k,l I k 

Finally, the formula for /i(7r) follows immediately from the first statement. □ 

The preceding lemma shows that each value of /i is equal to the sum of a generalised diagonal in the 
matrix (a^ ). 

Next, we calculate the i"^ norm of the vector formed by the entries Uij. 
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Lemma 4. We have 

Proof. Since /i is an orthogonal projection of /, we have 

I=ll/ll2=ll/l|l2+ll/-/l|l2 = ll/l|l2+ei- 

Therefore, H/iili = 1 ^ ^i- On the other hand, we have 

II/1II2 = X! ^ii°-ki{Tij,Tki) 



- a?- H — - ai^aki 



using [(4)] □ 



2.2 Random restrictions 

For X, F C [n] of equal size, let Tx,y denote the set of all permutations sending X to Y: 

Tx.Y = {tt e ^„ : 7r(X) = y)}. 

We call such a pair (X, F) a restriction. Let F) denote the subvector of /i supported on Tx,y- 

The final part of lLenima 31 shows that every value of fi is the sum of a generalized diagonal of (ay). 
It is natural to decompose g{X, Y) into two functions, one depending on the submatrix supported by 
X X Y, the other depending on the submatrix supported hy X x Y: 

gi{X,Y)^ J2 a,jT,„ g^iX ,Y) = g^{X J) . 

(In the definition of gi, Tij is, strictly speaking, the restriction of XTi- to Tx,y-) ILemma"3l immediatelv 
implies that g{X,Y) — gi{X,Y) + g2{X,Y). Note that gi{X^Y) and g2{X,Y) are both supported on 

Tx,Y- 

We now define a probability distribution TZ over the set of all restrictions, as follows. Each i G [n] is 
included in X independently at random with probability 1/2. Then, Y is chosen uniformly at random 
from all sets of size \X\. Note that this definition is symmetric between X and Y, and furthermore, 
{X,Y) has the same distribution as {X,Y). 

Most of this subsection will be devoted to the study of properties of typical restrictions. We start by 
calculating the mean and variance of V.[gi{X,Y)] when {X,Y) ^ TZ. This will enable us to show that 
E[gi(X, Y)] and E[g2{X, Y)] are typically small in magnitude. 

Lemma 5. Let X,Y C [n] with \X\ — \Y\, and define 

m(X,y)= E [gi{X,Y)], 

J X.Y 

where the expectation is with respect to the uniform probability measure on Tx.y- 
If {X, Y) ~ TZ, then the mean and variance of m{X, Y) with respect to TZ satisfy 

EtzH = 

and 

VkH < 2k- 
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Proof. We start with a formula for m{X, Y): 



Txy' \X\ 



Conditioned upon \X\, we have 

W.^\m(Y V\\\Y\] = 

\X\ 



En[m{X,Y)\\X\] = ^K^a,, = 0. 



2J 

Hence, E^ifm] = 0. 

The next step is to calculate E7j[m^]. Expanding the formula, we get 

\X\'^m(X, Yf = ^ l{i^x,j&Y]a% + E E l{ie^j,ieF}a»jaj/ 

i.j i j=il 

+ E E '^{i,keX,jeY}<^ijO-kj + E '^{i,k£X,j,l£Y}0'ijO'kl- 
j i^k k^i, 

Taking expectations, we get 

\X\^Ev\m{X, Yf\\X\\ = Fr[i eXAjeY]J2 4 + Pr[i e X A j,l e Y]J2J2 

+ Pr[i, fceXAjey]EE "^J^feJ + ^ e X A j, Z e y] E "fe'- 

Using [(4)1 together with 

E '^^^ ^ E '^'^J ^ 

k^i 

ffrom [Lemma 3|) we obtain 



\X\-1 



2Tn'_ r^/^i- '^z^2|| vll _ „2 ol^l^lv^„2 , (1^1 ^ 1)^ „2 

1 

2 



(n-l)2 ^ *J 

1,3 i.j '■■.j 



1 - 



n 



Taking expectations and using the estimate J2i j ^ij < n — 1 provided bv lLemma 4[ we conclude that 



t,3 

n + l 



< 



4:n{n - 1) 
1 



< 
- 2n 



□ 



The following lemma states some properties that a random restriction enjoys with probability close 
to 1. The lemma uses the following nomenclature for functions on a probability space (a.k.a. random 
variables): 

• A function (j) is {S, e)-almost Boolean if with probability at least 1 — <5, (pis e-close to ±1. In symbols, 

Pr[|(/)| e 1 ± e] > 1 - (5. 
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• A function (j) is [5,e)-almost close to C if with probabihty at least 1 — 5, 4> is e-close to C. In 
symbols, 

Pr[|0-C| <e] > 1-.5. 

1 /7 

Lemma 6. Let {X, Y) ^ TZ. With probability at least 1 — 3e^ , {X, Y) satisfies the following properties: 

(a) g{X,Y) is (e'^^'^ , e^''') -almost Boolean. 

(b) ¥\gi{X,Y)] and¥\g2{X,Y)] are -close to zero. 

(c) n{\g{X,Y)\~lf]<eT. 

1 /7 

Proof. We claim that each of the different parts holds with probability at least 1 — e^' . The lemma 
follows using a union bound. We will use the fact that selecting a random partition {X, Y) ^ TZ and then 
selecting a uniform random element in Tx,y is the same as choosing a uniform random permutation. 
This holds because for any X, the sets {Tx.y ■ \Y\ ~ \X\) partition S'„. 
We first deal with part (a). Suppose for a contradiction that 



Pr 

n 



Pr [\g{X,Y)\il±ey']>el 



4/7 



> e 



1/7 



It follows that Pr[|/i| ^ 1 ± el^^] > t-^^ This implies that 

mi - ff\>mh\-^f\> ^^Tf = 

This contradicts E[(/i — /)^] = ei, proving the claim for part (a). 

For part (b), we use lLcmma 5[ which gives the mean and variance (with respect to TZ) of E^^ ^ \g\(X, y)]. 
We have 

1 



Pr 

■R. 



E [5i(x,r)] 



> 



1/14 r- 



ei n _ 1 1/7 
- 2n ~ 2 1 ' 



.1/7 



El' > Cl 



1/14 



using Chebyshev's inequality. Assumption |(2)| states that ei > 1/n''/'^. Hence e^^^^ > 1/y/n, and so 



71. Therefore, 



Pr 

■R 



E [giiX,Y)] 



> e 



1/7 



1/7 



Since {X,Y) ^ TZ, the same holds for g2{X,Y). The claim for part (b) follows, using a union bound. 
For part (c), the starting point is 



Therefore, 



E[(|/i|-lf]<E[(/i-/f]=6i. 
E [i\g{X,Y)\-ir]\ =E[(|A|-in<ei. 



The claim now follows from Markov's inequality. 

We call a restriction typical if it satisfies the properties (a), (b) and (c) in lLemma 61 



□ 



2.3 Decomposition under a typical restriction 

In this subsection, we show that if {X, Y) is a typical restriction (meaning a restriction satisfying the 
properties listed in ILemma 61) . then the functions gi{X,Y) and g2{X,Y) have a particularly simple 
structure: up to translation, one of them is almost constant, and the other is almost Boolean. 

Many of the lemmas in this subsection start by assuming that a particular restriction {X, Y) is typical. 
In these lemmas, we will write 5,51,(72 for g{X,Y), gi{X,Y), g2{X,Y). 

The following technical lemma tackles the following situation. Suppose that some function <j) is almost 
close to Cq. Can we deduce that Co ~ E0? The lemma gives a sufficient condition (in the case Co = 0). 
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Lemma 7. Suppose that a function (p on a probability space satisfies the following properties: 

(a) The function (/) is {p, e)-almost close to 0. 

(b) There exists C G M such that E[(|0 + C| - 1)^] < 5. 
Then 

\H4>]\ < 3e + 3p + 6j— 

Remark. Condition (b) says that (j)~\-C is almost Boolean in the L? sense. As we shall see, conditions 
(a) and (b) together imply that C must be close to 1 or close to —1. 

Proof. Without loss of generality, we may assume that C > 0. We start by establishing the bound 



|1-C|<.+ ^^. (5) 

We distinguish between three cases: C<l~e, C>l + e and |1 — C| < e. In the latter case, we already 
have the desired bound. 

Suppose C < 1 — e. Whenever |(/)| < e, we have 

1-10 + CI > l-C-e>0. 

Since this happens with probability at least 1 — p, we deduce that (1 — p)(l — C — e)^ < S, verifying |(5)[ 
Suppose next that C > 1 + e. Whenever < e, we have 

|</' + C|-l>C-e-l>0. 

Since this happens with probability at least 1 — p, we deduce that {1 — p){C — e — 1)^ < 6, again 
verifying |(5)[ This completes the proof of |(5)[ 
When \t\ > 1, we have \t\ < t'^, and so 

E[||0 + C\- l|x{0+c>2}] < IE[(|0 + C\- l)'x{0+c>2}] < S. 
The triangle inequality implies that 

|E[(0 + C-l)x{0+c>2}]| <IE[|'^ + ^^-l|xw+c>2}] =]E[||'^ + ^^|-l|xw+c>2}] 
When t < -2, we have \t - 1\ = I - t < 3(-t - 1) = 3(|t| - 1) = 3||i| - 1|, and so, as before, 

E[|^ + C~ l|x{0+c<-2}] < 3E[||0 + C\- l|x{0+c<-2}] < 3<5. 
The triangle inequality implies that 

|E[(0 + C-l)x{0+c<-2}]| <3<5. 
Combining the two together, we get 

|E[(0 + C-l)x{|0+c|>2}]| <4,5. 
Define ijj — 4> + C — 1. Rewriting the last inequality in terms of ip, we have 

|IE[V^X{|^+1|>2}]| < 4(5. 

When IV' + 1| < 2, IV^I < 3. When 101 < e, we have IV-I < |1 - C'| + e < 2e + ^5/{l-p). Therefore 

I EMI < |lE[V'X{|0|<e}]| + \m'X{\<P\>e and |V+1|<2}]| + |lE[V'X{|iA+l|>2}] | 



< 2e+ J— ^ +3p + 45. 
V 1 



We conclude that 



<\'E[iP]\ + \l-C\<Se + 2j-^+3p + AS. □ 

V 1 
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Out first key step is the following lemma, which uses the fact that gi {X, Y) and 52 {X, Y) are inde- 
pendent pieces of g{X, Y) to deduce that, up to translation, both are close to Boolean. Moreover, at 
least one of them is close to being constant. 

We will use the following notation, when a restriction {X, Y) is understood. For a permutation it, 
TTi = 7r\x denotes its restriction to X, and 712 = 7r|^ denotes its restriction to X. Thus, gi depends only 
upon TTi, and 52 depends only upon n2- 

Lemma 8. Suppose {X,Y) is a typical restriction. Choose a, (3 uniformly at random from Tx,y- Then 
with probability at least 1 — 8e^^^, one of the following three cases holds: 

1/7 

(a) |,9i(ai) - .9i(/3i)| and \g2{a2) -52(/32)| < 2e/ . 

(b) Iffi(ai) - < 2eY' and \g2{a2) - 52(/32)| G 2 ± 2el^\ 

(c) |5i(ai) - e 2 ± 2e\'' and \g2{a2) - 52(^2)] < 2e\'\ 

Proof. Typicality implies that Pr[||g'| — 1| > e^^^] < e^^"^ . This implies that with probability at least 

1 — e-^ over the choice of tti, it is true that Pr7r2 [||.9('''')l ^ 1| > ] < • So with probability at least 

2/7 1/7 
1 — 2e-^ over the choice of (tti, 7r2), it is true that |5(7r)| is t-^ -close to 1. Thus with probability at least 

2/7 1/7 

1 — SC]^ over the choice of a, /3, all of the following are e{ -close in magnitude to 1: 

X = .91 (ai) -l-.g2(a2), V = .91 (ai) + .92(/32), 

z = +52(a2), w = 51 (/3i) + 32(^2)- 

Since ei is small enough, each of these four values is unambiguously close to either 1 or —1. 
If x,y,z are all close to the same value, then 

|5i(ai) - 5i(/3i)|, |fl2(a2) - 52(/32)| < 2e}/^. 

If X and y are close to different values, then 

I52M - 52(^2)1 e2±2el/^ 

Without loss of generality, we may assume that x is close to 1 and y is close to —1. Then g2{c(2) — g2{li2) € 

2 ± 2e\^^ . If z is close to —1, then 51(0:1) — giiPi) € 2 ± 2eJ^^. But this implies that 

w = x- {gi{ai) - giiPi)) - {g2{a2) - g2{p2)) e -3 ± 56^'. 
1/7 1/7 

Since ei is small enough, —3 + 5e-^ < —1 — , and we reach a contradiction. So when x and y are 
close to different values, x and z must be close to the same value. This implies that 

|5i(ai)-5i(/3i)|<2e}^ 
If X and z are close to different values, then we similarly obtain 

|<?i(ai) - e 2 ± 2el^\ 152(02) - g2{p2)\ < 'le\\ 

These cases are exhaustive. □ 

The preceding lemma shows that for most choices of a, /3, either both g\ and 52 act as if they were 
constant, or one acts as if it were constant, and the other acts as if it were Boolean, up to translation. 
The following lemma, which is the main result of this section, shows that in fact, one is almost zero, and 
the other is almost Boolean. 

Lemma 9. Suppose {X,Y) are typical restrictions. Then either gi is (SeY"^ ,19ey^)-almost close to zero 

1/7 1 /7 

and (?2 is {Ae-^ , 24ej^ )-almost Boolean, or the same is true with the roles of gi and 52 reversed. 
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Proof. Define 



a.p^i x.Y 

P2= Pr [\g2{a2) - 92^2)1 > 2ey']. 

a.pGi x,Y 

ILemma 8l implies that piP2 < 8ej^ . Thus, either pi < or p2 < Se^^ . Without loss of generality, 

1 /7 

we may assume that pi < . 

A simple averaging argument shows that for some choice of a, we have 

Pr [\g,{a,)-g,{(3,)\<2el/']>l-3eY\ 

P£J-X,Y 

Therefore, putting Ci ~ gi{ai), we deduce that gi is (3eJ^'', 2eJ^'')-almost close to Ci. 

Typicality implies that 5 = 51 +52 satisfies &i{\g\ — 1)'^] < ■ This must be true for some value C2 
of g2- The function gi — Ci is (SeJ^^, 2eJ^^)-almost close to zero, and so we can applv iLemma 7l with 
the following parameters: 

cl>^g,~Cu P^ie\'\ e^2e\'\ 5 ^ C^C, + C2. 

Since ei is small enough. ILemma~7l implies that 

\n9i]-Ci\<Qe\" + ^e\" + & 



On the other hand, by typicaUty, < e-^ . Therefore 

|Ci| < l7el/\ 

1/7 1 /7 

We conclude that gi is (3e;^ , 19e^' )-almost close to zero. 

We now turn our gaze to 32 • ILemma"8l implies that with probability at least 1 — 8eJ^^ over the choice 

1/7 

of a, 13, it holds that |.g2(a2) — 32(/32)| G {Ot^,} ±2€i . A simple averaging argument shows that for some 
choice of a, it holds that 

Pr [|52(«2) - 92m\ i {0, 2} ± < 8e'/\ 

P&J-X,Y 

Let C3 — 92{ct2)- Then 92 is concentrated on the three values {C3 — 2, C3, C3 + 2}. Another application 
of ILemma 81 will show that it is actually concentrated either on {C3 — 2, C3} or on {C3, C3 + 2}. Define 

91= Pr [92meC3 + 2±2ey\ 

q2= Pr [32(72) eC3-2±2ey']. 

When 92{(32) £ C3 + 2 ± 2t]''^ and 52(72) e C3 - 2 ± 2t^\ we have 1^2(^2) - 52(72)! e 4 ± ^t^\ In 

particular, since ei is small enough, in this case /32, 72 satisfy none of the options presented bv ILemma 81 

2/7 1/7 1 li 

Hence, we must have q\q2 < Se^ . Therefore, either qi < Se^' or q2 < 3e^' . Without loss of generality, 

1/7 1/71/7 

we may assume that qi < . Putting C4 = C3 — 1, we conclude that 92 — C4 is {Aei , 2t-( )-almost 

1/7 2/7 1/7 

Boolean. (Here, we used the estimate Se^ + 865^ < 46^ , true since t\ is small enough.) Our task is 
now to show that C4 is close to zero. 

\l1 1/7 \l1 1/7 

Since 51 is (36]^ ,196]^ )-almost close to zero, it follows that 5 — C4 is (Tcj^ ,2\e^ )-almost Boolean. 

By typicality, 9 is (e^^^, ey^)-almost Boolean. Therefore, with probability at least 1 — Te^^ — e^^^ > 
1/7 

1 — 86;^' over the choice of tt e Tx.y, 

5(7r) e {C4 ± 1} ± 216^^ and ^(Tr) e {±1} ± e]''^. (6) 
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Suppose that 7r+ € Tx,y satisfies [(6)| with g{Tr+) e\ -close to 1. Then either C4 is 22e}' -close to zero 



or it is 22ey^-close to 2. Similarly, if 7r_ £ Tx,y satisfies [(6) [with g{n-) '-close to —1, then either C4 



1/7 1/7 
is Tle^ -close to zero, or it is Tlt^ -close to - 

1/7 



1/7 



-2. Since t\ is small enough, if such permutations tt-j 



exist, then we can conclude that IC4I < 22ej^' . That would complete the proof of the lemma. 



It remains to rule out the case that for all permutations satisfying |(6)[ 5(71) has the same sign. That 
would imply that g is (SeJ^^, eJ^^)-almost close to L, where L £ {±1}- We apply [Lemma 7[ with the 
following parameters: 



A A 

= g-c, _p = 



1/7 



.1/7 



r A 6/7 



Since ei is small enough, the lemma implies that 



mg] 



L\<3el/' 



2Ae{ 



1/7 _ 



6/7 



\l-9el 



1/7 



Therefore E[g] is 28ey^-close to L. On the other hand, typicality implies that E[g] is 2e}' '-close to 

close to zero. Since L S {±1} and ei is small enough, this is a 



1/7 



zero. We deduce that L is SOe^ 
contradiction. 



1/7 



□ 



2.4 Random partitions 

Subsection 12.31 deals with random restrictions {X,Y). The main result. lEemma 9[ shows that with large 
probability, in the decomposition g{X, Y) = gi{X, Y) + g2{X, Y), one of the functions is almost constant, 
and the other is almost Boolean. In this subsection, we switch the order of the random choices, and 
deduce a property of random permutations. 

We will need the following classical theorem due to Esseen 0. For a modern proof, see [TH 4.1.b]. 
Note we require Esseen's version, rather than Berry's slightly weaker result [2]. 

Theorem 3 (Berry- Esseen). Let Xi, . . . , X„ be independent random variables with finite third moments, 
and let S be their sum. Define 

riiWu^Kxii 

Let N be a normal random variable with the same mean and variance as S . Then S and N are C^ip-close 
in distribution, where Cq < I is an absolute constant. Ln other words, for every t € R, 

|Pr[S' < t]-Fr[N < t] \ < Coip. 

Before stating the results, we need some definitions. If {X, Y) ^ TZ, then the marginal distribution 
of X is [/(2["1), the uniform distribution on the power set of [n]. With slight abuse of terminology, we 
call the subset X C [n] a partition, as it will correspond to the genuine partition {X,X'^). 

For a permutation tt G 5„ and a partition X G [n], define 

Pi ^ ai7r(i), P2 ^ aj7r(i)- 

1/7 1 /7 

We say that a partition X is good for tt if either Pi is 25ei -close to zero and P2 is 25ej^' -close to ±1, 
or the same is true with the roles of Pi and P2 reversed. Otherwise, we say that X is bad for tt. We say 
that the permutation tt € Sn is good if with probability at least 4/5, a random partition X is good for 
TT. Otherwise, we say that tt is bad. 

The following lemma shows that most permutations are good. 

1 /7 

Lemma 10. With probability at least 1 — SOe^ , a random permutation tt G Sn is good. 
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Proof. Bv lLcmma 61 a restriction {X, Y) TZis typical with probability at least 1— 3e}''^. Suppose {X, Y) 
is typical. Choose a uniform random permutation tt G Tx,y- ILemma"9] shows that with probability at 

1/7 

least 1 — 7ei , X is good for tt. Hence, if we choose a restriction {X, Y) ^ TZ and a permutation tt S Tx,f 

1 /7 

uniformly at random, then X is good for tt with probability at least 1 — lOej^ . 

Given X, the sets Tx^y partition Sn- Therefore, the permutation tt chosen in the process above 
is chosen uniformly at random from Sn- Furthermore, by definition, the marginal distribution of X is 
[/(2["1). Therefore, if we first choose a permutation tt G Sn uniformly at random, and then we choose 
X ^ C/(2["l), then X is good for tt with probability at least 1 — lOe}^^. Thus, the average probability 

1/7 

(over TT G Sn) that a random partition is bad is at most lOej^ : 



E 

irGS„ 



Pr [X is bad for tt] 



< 106}/^ 



1/7 

Markov's inequality now implies that the probability that tt is bad is at most 50e^ : 



Pr 



Pr [X is bad for ttI > 1/5 



□ 



The next lemma shows that if tt is a good permutation, then the generalized diagonal ai7r(i) corre- 
sponding to TT has a special structure: one of its elements is 'large', and the rest are 'small'. This is, 
essentially, a consequence of the main statement of [11], but, for the sake of being self-contained, we give 
a full proof. 

1/7 

Lemma 11. Suppose tt G Sn is a good permutation. Then for some m G [n], \ann,(m)\ is SOej^ -close 

1/7 

to ±1, and for i^m, |aj^(j)| < bOe^' . 

Proof. The proof is inspired by one of the proofs in [llj . Considering what happens when an element 
'switches sides' allows us to group the elements ai^(j) into two groups: 'small' elements (close to zero) 
and 'large' elements (close to ±1). Similar considerations show that there can be at most one large 
element. The crucial part is showing that not all elements can be small. Indeed, in this case, the sum 
Pi in the definition of goodness is approximately normal, and so it cannot be concentrated on the two 
values {0, 1} or {0, —1}. The formal proof is as follows. 

Define Si = ai^^i^iy Since tt is good, Sq :~ X^iLi SOe^^-close to either 1 or —1. It cannot be close 

1/7 

to both, since ei is small enough. Choose K G {±1} so that 5*0 is SOej^ -close to K. 

Define T{X) = Y^i^x Let X - f7(2["l), and put T = T{X), so that T is also a random variable. 
Note that T — Sq/2 + Y!,7=i where Wi — Si{xiex — 1/2). Clearly, X is a good partition for tt if and 
only if X is a good partition for tt. Since X and X are equidistributed, given that X is good for tt, T is 
25e}^^-close to zero with probability 1/2, and 25eJ^^-close to K with probability 1/2. We conclude that 

1/7 1/7 

with probability at least 2/5, T is 25ej^ -close to zero, and with probability at least 2/5, T is 25ej^ -close 
to K. 

Consider any Si. Since 2 • 1/5 < 1 (here, 1/5 is an upper bound on the probability that a random 
partition is bad for vr), there is some choice of F C [n] \ {i} such that both Y and Y U {i} are good for 
TT. Since \T{Y) — T{YU {i})\ = \si\, necessarily either \si\ < 50e}^'' {si is small) or \si\ is 50e}^''-close to 
\K\ = 1 {si is large). 

1 /7 

We claim that not all the Si can be small. Assume, for the sake of contradiction, that \si\ < 50e^ 
for all i. Applying Berry-Esseen with Xi = Wi shows that T is ■(/'-close in distribution to a normal 
distribution N ^ Af{So/2,a'^), where 



„2 „/, _ Z^j=l 



|3 



4^ ^ (Elli^f)^/^' 
3 ^ 9 



The upper bound on \si\ implies that \si\ < 50ei sf, and so 
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1/7 

We now obtain a lower bound on a. With probability at least 4/5, T is 256^ -close to zero or to and 

1/7 

so its distance from its mean 5*0/2 is at least 1/2 — SOe^^ . Hence 

1/7 

This shows that ijj — 0{1). Concretely, when e-^ is smaU enough, ■0 < 1/10. 

For every interval /, Berry-Esseen shows that | Pr[r e /] — Pr[A^ € /]| < 2ip. We consider three 

1/7 1/7 

intervals: /i = ± 2be^ , I2 — K zL 25e^ , and I3 is the interval 'in between': when K — 1, I3 — 
(25e}/^ 1 - 25eJ^^), and when K = -1, I3 ^ (-1 + 25e}''^ -25eJ/^). By assumption, 

Pr[r e /i] > 2/5, Pr[T e /a] > 2/5, Pr[r G /g] < 1/5. 



Since 2ip < 1/5, we deduce that 



Pr[iV e /i] > 1/5, Pr[Af e /2] > 1/5, Pr[iV e /g] < 2/5. 

The density of a normal distribution is bitonic (increasing and then decreasing), and so 

We have = I/2I = 506^^ and I/3I = 1 — 506^''. Therefore, the left-hand side of |(7)| is at most (say) 

4/5, and both terms on the right-hand side are at least (say) 1 (since ei is small enough), a contradiction. 

1/7 

Concluding, there must be some m such that |s„i| is 25e^' -close to 1. We claim that there cannot be 

1/7 

two such indices m, I. Suppose, for the sake of contradiction, that both \sm\ and |s;| are 25€^ -close to 1. 
Since 4 • 1/5 < 1, there is some choice of F C [n] \ {to, 1} such that all of F, F U {to}, Y U {I}, Y U {m, /} 
are good for vr. Since T{Y U {to}) = T{Y) + s,n and T{Y U {/}) = T{Y) + si, we see that s„, si must 
have the same sign. Since T{Y U {to, I}) — T{Y) + Sm + si, this implies that Y and Y U {m, /} cannot 
both be good. This contradiction shows that there can be at most one large s,n- □ 



2.5 Strong lines 

The previous section showed that if we pick a generalized diagonal at random in the matrix (aij ) , then 
with probability close to 1, we can designate exactly one element in it as 'large'. Corollary [T^ restates 
this formally. 

1 /7 

In this subsection, we say that an entry aij is large if |a.y | is 506^ close to 1. Otherwise, we say it 
is small. Note that, contrary to the usage in lLemma 11[ small elements need not be close to 0. While 
ILemma TT] allows us to deduce that most of the non-large elements in (fly ) are actually close to 0, for 
what follows, it will be enough for us to just maintain a distinction between large elements and non-large 
elements. 

Let {X, Y) be a restriction. Denote by A[X, Y] the submatrix {aij)i^x,jeY ■ We say that a generalized 
diagonal in F] is good if it contains exactly one large entry. We say that {X,Y) is q-good if with 
probability at least 1 ~ q, a, random generalized diagonal in A[X, Y] is good. 

1/7 

Corollary 12. The restriction {[n], [n]) is bOei -good. 

Proof. Immediate from lLcmma lOl and ILemma 111 □ 

1/7 

Our goal is to deduce that {[n], [n]) has a row or column which contains (1 — 0{e-^' ))n large entries. 
The general plan of attack is to prove this by induction on n. We will have a separate argument for 
small values of n, and an inductive argument for large n. The latter will use the following lemma, which 
we will apply with \X'\ = L|^|/2J. 

Lemma 13. Suppose that {X,Y) is q-good for q < 1/2. For every X' C X there exists F' C F with 
\Y'\ = \X'\, such that either {X' , Y') or {X \X',Y\ Y') is q-good. Similarly, for every Y' C Y there 
exists X' C X with \X'\ = \Y'\, such that either {X',Y') or {X \X',Y\ F') is q-good. 
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Proof. By symmetry, we need only prove the first statement. Fix X' C X. Since the sets {Tx'.y ■ Y' C 
Y, \Y'\ = \X'\) partition Tx,y, the fact that {X,Y) is g-good implies that for some choice of Y' , the 
probability that a random generalized diagonal corresponding to a permutation in Tx'^y' is good is at 
least 1 — q. Choose such a Y' . 

Let px be the probability that a random generalized diagonal in is good, and let p2 be the 

probability that a random generalized diagonal in A is good. Thenpi(l— p2) + (l— Pi)P2 > 
1 — g. Wc claim that this forces max(]3i,|52) > 1 ^ 

Indeed, let pi = (1 + 5x)/2 and = (1 + '52)/2, where \5i\, \52\ < 1. We have 

Pi{l-P2) + {l-pi)P2 = ^ • 



Since 1 — g' > 1/2, we must have 8162 < 0. Without loss of generality, we may assume that Si > 0. Since 
1 > —S2, we have 

1 + 61 ^ 1 - S162 ^ , 
Pi = ^->^— >l-9, 

proving the lemma. □ 

Let {X,Y) be a restriction. If i G X, we say that row i is p-strong for {X,Y) if at least (1 — p)\Y\ 
of the entries {uij : j £ Y} are large. If j G Y, we say that column j is p-strong for {X, Y) if at least 
(1 — of the entries {uij : i & X} are large. 

Wc say that {X,Y) has a p-strong row (resp. column) if some row (resp. column) is p-strong for 
{X, Y). We say that {X, Y) has a p-strong line if it has either a p-strong row or a p-strong column. 

Our goal is to show that ([n], [n]) has a strong line. We start by showing that two strong lines must 
coincide. 

Lemma 14. Suppose that {X,Y) is q-good. Let Xi,X2 C X, and let Yi,Y2 C Y with \Yi\ = \Xi\ 
and \Y2\ = \X2\- Suppose that {Xi,Yi) has a pi -strong line, and that {X2,Y2) has a p2- strong line. If 
(l-pi)|Xi| > 1, (l-p2)|X2| > 1 and 

then the strong lines must he the same (defined by the same row or by the same column). 

Proof. Suppose, for the sake of contradiction, that the two restrictions have different strong lines. Let 

Li C XixYi consist of the first ti = \{1 —pi)\Xi |] indices of large elements in the strong line of {Xi,Yi), 
and let L2 consist of the first t2 = \{1 — P2)|-''^2n indices of large elements in the strong line of {X2, Y2). 

Say that (ii, ji) G Li and {12,32) S L2 conflict if either i\ = 12 or ji = j2 (or both). If Li is row i 
and L2 is column 7, then an entry on Li not on column j never conflicts with an entry on L2 not on 
row i. Therefore, there are at least {ti — l){t2 — 1) non-conflicting pairs. If both Li and L2 are rows 
(resp. columns), then two entries conflict only if they are on the same column (resp. row). Therefore, 
the number of non-conflicting pairs is at least tit2 — min(ti,t2) > (^i — 1)(^2 — !)• 

For each non-conflicting pair, the probability that a random generalized diagonal in A[X, Y] goes 
through both entries of the pair is 1/|X|(|X| — 1). Since these events are all disjoint, it follows that 

(tl - l){t2 - 1) jtl - l){t2 - 1) 

|xp ^ \x\i\x\-i) 

Since ti,t2 > 2, — 1 > ti/2 and f2 — 1 > ^2/2. Using > 1 — pi and t2/\X2\ > 1 — P2, we deduce 

that 

(1-Pi)(l-P2)^^<4,, 
contradicting our assumption. □ 

Note that the conditions (1 — pi)|Xi| > 1 and (1 — P2)|-'^2| > 1 simply guarantee that each strong 
line has at least two large elements. (If one of the strong lines had only one large element, then it could 
be contained in the other strong line, and so there would be no contradiction.) 

Our next result says that if there is one strong line, then there cannot be many large entries outside 
the line. For the proof, we need the simplest case of Bonferroni's inequality. 
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Theorem (Bonfcrroni). Let Ai, . . . , Ah be events. Then 

Pi-[Ai V • • • V Ah] > Pr[Aj] - Pt[A, A Aj]. 

i i<j 

Lemma 15. Suppose that{X,Y) is q- good and has a p- strong line. Let m = \X\, and let g ^ 2q/{l—p). 
//m > 6, (1 — J3)m > 1, 2g'm > 1 and g < 1/2, then that line is actually (q + 3 g) -.strong. 

Proof. Without loss of generality, we may assume that the p-strong line is row i. Since {X, Y) is g-good, 
a random element in F] is large with probability at least (1 — q)/m. Therefore, v4[X, F] contains 
at least (1 — q)m large entries. 

Suppose that row i is not {q + 3p)-strong. Then ^[X, y] contains at least ?>gm large entries outside 
row i. ILemma 141 implies that no other line can be (1 — 2p)-strong. Therefore, no column can contain 
more than 2gm large entries. So for any column yl[X, F] contains at least gm large entries outside 
row i and column j. The probability that a random generalized diagonal in A[X, Y] hits any single one 
of these, given that it hits a specific large entry in row i, is l/(m— 1), and the probability that it hits any 
two of them is at most l/(m— l)(m — 2). Therefore, Bonferroni's inequality implies that the probability 
that a generalized diagonal in A[X, Y] contains at least two large elements is at least 

, gm [gm-\-l){gm) \ _ {l-p)gm / gm+l \ 

^ ~^'\^^i^~2{m-l){m-2))^ m-1 \ ^ 2(m - 2) j ^ ^' 

since gm + 1 < m/2 + 1 < to — 2. But this probability must be at most q, a contradiction. □ 

The following sequence of lemmas shows the existence of a strong line in {X,Y), given that {X,Y) 
is g-good for q sufficiently small depending on \X\. 

Lemma 16. Suppose that [X^Y) is q- good for some q < ; where m = \X\. Then (X,Y) has a 

Q-strong line. 

Proof. Without loss of generality, we may assume that X = Y = [to]. Since q < 1/to, there must exist 
a permutation tt G 5*^, such that all generalized diagonals of the form {(z, 7r(i) + j) : j G [m]} are good. 
Without loss of generality, we may assume that tt is the identity, and that ai_i is large. For j £ [m], let 
a.i^ij^^j be the large entry on the corresponding diagonal. If i ^ 1 and i+ j ^ 1, then a random generalized 
diagonal in ^[[m], [m]] passes through both ai^i and Qi^i+j with probability \/{m[m — 1)), contrary to 
our assumption. Therefore, all large entries are either on row 1 or on column 1. If there is an entry a^^i 
not on row 1 and an entry ai j not on column 1, then we again reach a contradiction. It follows that 
either row 1 or column 1 consists of large elements only. □ 

We now improve this result using induction. 

Lemma 17. Suppose that {X^Y) is q-good for some q < where m = \X\. Then {X,Y) has a 

0-strong line. 

Proof. The proof is by induction on to. When m < 5, the claim follows from ILemma 16[ so suppose that 
TO > 6. 

Let X' C X he an arbitrary subset of size s = [to/2J. ILemma T3l shows that there exists Y' C Y 
with \Y'\ = \X'l such that either (X'^Y') is g-good or {X \X',Y \ Y') is g-good. The induction 
hypothesis implies that either {X' , Y') or {X \ X' . Y \ Y') has a 0-strong line. Similarly, \iY' <zY is an 
arbitrary subset of size s = [to,/2J, then there exists X' <Z X with \X'\ = \Y'\ such that either {X' ,Y') 
or {X \X',Y\ Y') has a 0-strong fine. 

Since s > 3, we have 

/ s \2 9 1 1 
Vm/ 49 to 

so ILemma 141 implies that all 0-strong lines arising from different choices of X' or Y' must be defined by 
the same row or column — say row i. 

We claim that row i can have at most one small entry. If row i has at least two small entries aij , Oik , 
then there exists Y' cY with \Y\ = s, such that j e Y' and keY\Y'. For any X' C X with \X'\ = s, 
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row i can be 0-strong in neither {X', Y') nor {X \ X', Y \ Y'), a contradiction. Thus, row i has at most 
one small entry. 

Suppose that row i has exactly one small entry. Since q < \/Am < 1/m, a random entry in A[X, Y] 
is large with probability at least {1 — q)/m > 1/m — 1/m^, and so there must be at least m large entries. 
Exactly m — 1 of these are on row i. Let aki be another large entry. The probability that a random 
generalized diagonal passes through both aki and one of the large entries on row i is at least 

— (l 1 A 4/5 1 

m \ m — 1 J ~ m Am ' 

a contradiction. Hence, row i must be 0-strong for A[X, Y], completing the proof. □ 

We now use induction to tackle the case of large \X\. 

Lemma 18. Suppose that {X^Y) is q-good for q < 1/50. Then {X,Y) has a liq-strong line. 

Proof. The proof is by induction on m :— \X\. When m < l/{Aq), the statement of the lemma follows 
from lLemma 17[ so suppose that m > l/(4q) > 12. 

Let X' C X he an arbitrary subset of size s — [m/2j > 6. ILemma T3l shows that there exists Y' C Y 
with \Y'\ — \X'\, such that either {X',Y') is g-good or {X',Y') is q-good. The induction hypothesis 
implies that either {X', Y') or {X \ X', Y \ Y') has a 13(7-strong line. Similarly, ilY'cY is an arbitrary 
subset of size s = [m/2j, then there exists X' C X with \X'\ = \Y'\ such that either {X',Y') or 
{X\X',Y\ Y') has a ISq-strong line. 

Since (1 — 13q)s > 1 and (1 — 13q)^{s/m)^ > Aq, ILemma Ml implies that all the 13q-strong lines 
arising from different choices of X' or Y' must be defined by the same row or column — say row i. 

We claim that row i has at most [13gm + Ij small entries. Indeed, suppose it has at least [IBqm + 2J 
small entries. Then there exists a subset Y' C Y with \Y'\ = s, such that the F'-part of row i contains at 
least [13qs-|-lJ small entries and the y\F'-part of it contains at least [13g(m — s) -I- IJ small entries. For 
any X' C X, row i is not 13g-strong in either {X' , Y') nor {X \ X', Y \ Y'), a contradiction. Therefore, 
row i is a (13(7 + l/»77,)-strong line for (X, Y). 

Let p — 13q + 1/m. We are going to apply iLemma 15l Clearly m > 12, and it is easy to check that 
(1 — p)m > 1 and g < 1/2. Slightly more delicately, we have 

Aqm 

= -; 75 TT- > > 1- 

1 — liq — 1/m 

Hence, bv lLemma 151 row i is (q + 3f3)-strong. Since p < 1/2, we have q + 3g < 13q, so row i is IBg-strong, 
completing the proof. □ 

It might seem that the condition 2gm > 1 is very tight. This will not matter for us, but in fact, one 
can prove a version of ILemma 151 with a weaker condition, at the cost of obtaining a worse guarantee on 
the strength of the line. 

1 /7 

Corollary 19. There exists an 0{ei )-strong line for {[n], [n]). 

Proof. Follows immediately from [Corollary 12| and [Lemma 181 □ 
2.6 Culmination of the proof 

In this section, we will see what [Corollary 19| implies in terms of the original family J-. Without loss 
of generality, we may assume for the rest of this section that the ©(e^ )-strong line whose existence is 
guaranteed by [Corollary 19[ is row 1 . 

What the corollary implicitly says is that the matrix (aij) looks very like the canonical example 
shown in the introduction: 



... i_i i_i ... 



1 



I _i ... _i 1 ... 1 I 

\ n n n n / 
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Indeed, the corohary shows that, without loss of generality, the first row consists mainly of elements 
which are very close to ±1. Since the line must sum to zero, we know that roughly half of these are 
close to 1, and roughly half to —1. This information will enable us to deduce that T is close to a disjoint 
union of roughly njl cosets. 
For J, J S [n], we define 



By |(3)[ we have 



(n-l)! 



In 

(8) 



2 2(n-l) 

So we have Ty w (0^ + l)/2. More precisely, we have the following. 

1/7 

Lemma 20. Each is 1/n-close to {uij + l)/2. If aij is large, then Tij is 26e^ -close to {0, 1}. 



Proof. The formula (3) for implies that \aij\ < 1. We have 



2 ' 2(n-l)' 

The second term has absolute value at most < -, since n> 2. 

2{n—l) — n ' — 
1/7 1/7 

A large entry is bOe^ close in magnitude to ±1, and so {aij + l)/2 is 25ei -close to {0, 1}. Finally, 

I I 1/7 

assumption (2) implies that 1/n < e-^ . □ 
We are now almost ready to prove our main result. 

1/7 1 /7 

Lemma 21. The number of tu which are 26ei -close to 1 is 0{ei )n-close to n/2. 

1/7 

Proof. Let A^'o be the number of tu which are 26e^ -close to 0, and let iVi be the number of tu which 
are 26ey^-close to 1. ILemma 3] shows that 

n 
i=l 

On the other hand, 

(1 - 26eY'^)Ni <T< 266^/^0 + {n - Nq) ^ n - {I ~ 26eY'^)NQ. 
Substituting T — n/2, we obtain 

No,Ni < (l + 0(e;/'))n/2. 

1 /7 1/7 

The fact that row 1 is an 0{€i )-strong line implies that A^'o + A^i = (1 — 0{ei ))n, and so 

A^i > (1 - 0(ey^))n - A^o > (1 - 0{el^'^))n/2. □ 
The main result easily follows. 
Corollary 22. Suppose that n > 4, and 

1 

< ei < eo, 



,7/3 



where eo > is an absolute constant. Let J- C Sn be a family of permutations with size \J-\ — n\/2, 
satisfying 

IE[(/-/i)'] -ei, 

where f — 2xt ~ Ij fi is the orthogonal projection of f onto Ui . There exists a family Q C Sn which 

1/7 

is a union of dn disjoint 1-cosets, where d is 0{t^ )-close to 1/2, satisfying 



\gAT\ < Oie{ 
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Proof. Lemma [5T] implies that the set 5' = {i : ti^ > 1 — 26eJ^''} has cardinahty which is 0(ey^)n-close 
to n/2. Define 

Note that the Tu are pairwise disjoint, and so \Q\ = (n — 1)!|S'|. By the definition of S, we have 

\^ng\>{l- 26el^'){n - 1)!|5| = (i - OieY'))n\. 

It foUows that 

\TAg\ = \T\ + \g\-2\Tng\ 

< n\/2 + (1 + 0(e}/'))n!/2 - 2(i - 0{ey^ j)n\ 

3 Proof in the general case 

Up until now, we have only discussed the case — nl/2. Much of the argument remains intact for 
general values of c = | J^|/n!, although an additional argument is required in the culmination of the proof. 
Also, eo will now depend upon c. More concretely, let 

r/ = min{c, 1 — c}. (9) 

As we shall see below, for the proof to go through, we will need eo — 0{Tf). Indeed, we shall make the 
following assumption. 

^ < ei < co77^ 



nV3 

To explore all of these issues, let us follow the existing proof and see how it adapts for arbitrary c. 



Matrix representation ( § 2.1 ) The coefficients aij for c = 1/2 were defined so that the following 
holds. 

/i=^a.,T,,. (10) 

As we remarked in the proof of ILemma"3l our definition of a.^ can be derived from this formula via 
Fourier inversion. For arbitrary c, the corresponding definition is 

a., =(n-l)(/,r,,) (2c -1). m 



Under this definition, (10) holds. Straightforward calculations yield the following updated versions of 
ILemma"3l and ILcmma 4l 

Lemma [3f. We have 

fi = ciijTij. 

Furthermore, 

aij — 2c — 1 Vi e [n] , aij — 2c~ 1 V j £ [n] . 

3 i 

For each permutation tt, we have: 

/i(7i") = X!""W- 

■i 

Lemma [4f. We have 

Y,al^{n-l){l-e,)-{n~2){2c-l)\ 

i,3 
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Random restrictions ( |§ 2.2^ The proof of lLemma 51 becomes more cumbersome. Curiously enough, 
the variance of the random variable in question is actually maximized when c = 1/2, and so the bound 
on the variance holds true for arbitrary c. Here is the updated version. 

Lemma Hf. If {X,Y) is a restriction, define m{X,Y) = Et^^ y)]. Let iX,Y) - TZ. Then 

EkH = c - 1/2 andYnim] < l/2n. 

Proof. We only give the exact formula for Vk [to] : 

r 1 "+1 (2c-l)^ 1 ^ 

The proof of lLemma 61 remains the same, adjusting for the general value of E7^[to]. 
Lemma[6f . Let {X, Y) ^ TZ. With probability at least 1 — Se^ , {X, Y) satisfies the following properties: 

(a) g{X,Y) is {e\^'' ,ey^)- almost Boolean. 

(b) E[gi{X,Y)] andE[g2iX,Y)] are el^'^ -close to c~ 1/2. 

(c) n{\9{X.Y)\-lf]<eT. 

We redefine a typical restriction as one satisfying these updated properties. 

Decomposition under random restriction ( § 2.3 ) ILemma Tl and lLemma 81 are not affected by the 
value of c. The value of c comes into play in the main result of this section, ILemma "9l in two ways. 
Firstly, the result is affected by the change in ILemma 51 Secondly, there is a hidden dependence of eo on 
c, namely 

r^>lbe\'\ (11) 
Lemma [9f. Suppose that {X,Y) is a typical restriction. Then either gi is {3ey^ ,19£\^'^) -almost close 

1 /T 1/7 

to c — 1/2 and (72 + 0—1/2 is (4e]^ , 24:6^ ) -almost Boolean, or the same is true with the roles of gi and 
g2 reversed. 

Proof. At the very end of the proof, we need to rule out possibility that for all permutations ■k £ Sn 
satisfying 

g(7r) e {C4 ± 1} ± 21e;/" and g(7r) e {±1} ± e\'\ [(6)] 

the sign of g(7r) is the same. This would imply that g is (SeJ^^, ey^)-almost close to L G {±1}. Applying 
ILemma 7l this in turn would imply that E[(7] is 28eJ^^-close to L. On the other hand, typicality implies 

1/7 

that E[g] is 2e^ -close to 2c — 1. In order to obtain a contradiction, we need to assume that 2c — 1 is 



not 306]^ -close to ±1. This is equivalent to (11) □ 



Random partitions ( |§ 2.4[ ) At the beginning of this section, we defined the concept of a good parti- 
tion, which we now need to update. For a permutation -k £ Sn and a partition X <Z [n], define 

Pi '■= ^aj;r(i), P2 '■— Qi7r(-t)- 

1/7 1/7 

We say that the partition X is good for n if either Pi is 25e^ -close to c — 1/2 and P2 is 25ei -close to 
{— c— 1/2, 3/2 — c}, or the same is true with the roles of Pi and P2 reversed. We say that the permutation 
TT G Sn is good if with probability at least 4/5, a random partition X ~ U(2^"'^) is good for tt (this is the 
same definition as before). With the updated definition. ILemma 101 remains the same. 

As for ILemma TTl apart from slightly updating the statement, there is also a hidden dependence of 
eo upon c, namely 

V = n{eY'). (12) 
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1/7 

Lemma lllf . Suppose that tt £ Sn is a good permutation. Then for some m G [n], |am7r(m)l i^ ^Oe^ - 

1 /T 

close to {2c, 2(1 — c)}, and for i^m, |aj^(j)| < SOej^ . 

1 /7 

Proof. We redefine 'large' elements as those which are SOe^^ -close in magnitude to {2c, 2(1 — c)}. An- 
alyzing what happens when a single Si switches over, we deduce as in the original proof that each Si is 
either small or large. 

For the Berry-Esseen argument, we need a lower bound on . In the original proof, we deduced 

1 /7 

such a bound from the fact that with probability at least 4/5, it holds that \T — 5*0/21 > 1/2 — 50ej^ . 

1/7 

The same argument shows that with probability at least 4/5, it holds that \T — 5*0/21 > rj — bOe^' , and 
therefore 

This implies that 



where the implied constant does not depend upon c. Condition (12) guarantees that (say) -0 < 1/10. 
The intervals Ii , I2 retain their length, while for we get the guarantee 

1/3 1 > 2ri~bQe\'\ 

Recall inequality |(7)[ from which we would like to derive a contradiction: 



The left-hand side is at most (roughly) l/(5?7), while the right-hand side is il(e]^ ). We get a contra- 

— 1/7 I 1 

diction ii 1/iq = 0{€^ ), which is the same condition as (12) 



The rest of the proof goes through without change. □ 



Strong lines ( § 2.5 1 As we mentioned in the introduction, this part is almost completely independent 
of the rest of the proof. All we have to do is redefine a large entry so that it conforms to the specification 
of Lemma [TTTl that is, |ay | is 50ey-close to {2c, 2(1 — c)}. With this small change, all the results in 
this section carry through. 



Culmination of the proof ( § 2.6 ) This section requires a small overhaul. Whereas for c = 1/2, a 
large element was always close to ±1, now all we know is that it is close in magnitude to {2c, 2(1 — c)}. 
Its actual value is therefore close to one of the values {2c, 2(1 — c), —2c, —2(1 — c)}. Defining as before, 
this means that Tij is close to one of the values {0, 1, 2c, 2c — 1}. Of these, one is always outside [0, 1] 
and so cannot occur, and one is a 'medium' value, 

[2c ifc<l/2, 
|2c-l if c> 1/2 ' 

lying inside the interval (0, 1). An additional argument is needed to show that such medium values do 
not actually occur in large quantities on the strong line. 
We first rearrange the formula [(3)[| 

^ _ l^nr.j-l _ 2(n-2)c+l , n ^ ^ 



(n-1)! 2(n-l) ^ 2(n-l) 

Roughly, we have r.y w aij /2 + c. More precisely, we have the following analogue of ILemma 201 
Lemma I20f . Each Tij is 2/n-close to aij/2 + c. 

1 /7 

If aij is large, then Tij is 26e^ -close to {0,1,7}. 
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As before, without loss of generality, we may assume that row 1 is the strong line. Before proving the 
analogue of lLemma 211 we need to show that for any two 'reasonable' large entries on row 1, either both 
are close to {0, 1}, or both are close to 7. Since most entries turn out to be 'reasonable', this implies a 
dichotomy: either most entries are close to {0, 1}, or most are close to 7. Since the row sums to roughly 
cn, the second case cannot occur. 

For J g [n] , let r(j ) be the probability that a random generalized diagonal passing through aij is good. 

1/7 

We say that an entry aij is reasonable if aij is large, r(j) > 4/5 and 5({l},{j}) is {1/5, )-almost 
Boolean. (Recall that .9({l},{j}) is the function /i restricted to permutations in Tij.) 



1/7 

Lemma 23. Assume that 7 is Ibdci -far from {0,1}. Let j,k G [n] be such that aij and aik 



are 



reasona> 



1 /7 

ble. Either both tij and Tik are 26e^ -close to 7, or neither of them are. 



Proof. If TT e Tij then (jfc)7r e Tik- Since 4 • 1/5 < 1, there exists a permutation tt € Tij such that both 
/i(7r) and /i((jfc)7r) are ey''-close to ±1, and both tt and (jfc)7r are good. Let i = 7r^^(fc). Note that 

2{n — 1) 

- aik) ~ (/i((jfc)7r) - Oij) = aij - au- = {rij - Tik). 

1/7 

Since both aik and are small, the left-hand side is l{)2e^' -close to {0,±2}, and therefore Tij — Tik is 
102ey^-close to {0,±n/(ri— 1)}. Assumption |(2)[ implies that 2e\^'^ > 2/n > l/{n— 1), and so — Tik 
is 104ey^-close to {0,±1}. 

1/7 1 /7 

Suppose for a contradiction that Tij is 26e^ -close to 7, and that Tik is 26e^ -close to 6 € {0,1}. 
Then 7 is 156ey'^-close tob + {0, ±1} G {-1, 0, 1, 2}. Since 7 G (0, 1), 7 must be 156ey^-close to {0, 1}, 
contradicting the assumption of the lemma. □ 

Then following lemma says that most of the entries aij are reasonable. 

1 /7 

Lemma 24. The probability that aij is not reasonable for a uniform random j G [n] is 0{€i ). 

Proof. [Corollary 19| shows that the probability that aij is not large is 0(ej^^). 
Corollary 12 shows that E[l — r{j)] < 506^ , and so, by Markov's inequality. 



Pr [l-r(j-)>l/5]<^ = 250e}/^ 
jeln] 1/5 

Notice that fi is {el^\ ey'^)-almost Boolean, since otherwise E[(/i - f)'^] > E[(|/i| - 1)^] > e^^'^e^^'^ 
ei. Therefore, by Markov's inequality. 



5/7 

Pr [g{{l},{j}) is not (1/5, ey'^)-almost Boolean] < 7^ = 56^^^ 
jeln] 1/5 



The lemma follows from a union bound. □ 

We can now prove the analogue of lLemma 211 
Lemma I21f . The number of tu which are 51eJ^^-cZose to 1 is 0{e\^'^)n- close to cn. 

1 /7 

Proof. There are two cases, depending on whether 7 is 1566]^ -close to {0, 1} or not. 

Suppose first that 7 is 156eJ^^-close to {0, 1}. Let A^o be the number of tu which are 182e}/^-close 
to 0, and let A^i be the number of tu which are 182ey^-close to 1. Lemma [SQ shows that 

n 

T := ^ Tu = cn. 

On the other hand. 



(1 - \%2t^'^)Ni <T < 182ey^Aro + (n - Aq) = n - (1 - 182e}/^)Ar, 



0- 
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Substituting in T ~ cn gives: 

Ni < (l + 0(ey^))cn 



No<{l + 0{ey'))(l-c)n, 



|Corollary 19| and Lemma [2011 together show that iVo + A^i = (1 — 0{e-^' ))n, and so 

iVi > (1 - OieY'))n - TVo > (c - 0(ey'))n. 

1/7 

This completes the proof when 7 is Ibde^ -close to {0, 1}. 

Suppose next that 7 is 156eJ^^-far from {0,1}, so that ILemma 231 applies. By ILemma 241 all but 
0(ej^^)n of the entries in the first row are reasonable. ILemma 23l implies that either all of the corre- 
sponding Tii are 26eJ^^-close to 7, or none are. In the latter case, they must be 26eJ^''-close to {0,1}, 
and so an argument similar to the preceding case proves the lemma. It remains to rule out the case that 

1 /7 

all reasonable tu are 26e^ -close to 7. 

1 /7 

Suppose for a contradiction that all the reasonable tu are 26€i -close to 7. Assume first that c < 1/2, 

1/7 

so that 7 — 2c. Since ah but 0{€{' )n of the Tu are reasonable, we have 

T = ^TH> (l-0(e}/'))2cn. 

i 

This contradicts the equation T = cn since ei is small enough (not, here, depending on c). 
Assume now that c > 1/2, so that 7 = 2c — 1. Then we have 

(1 - c)n = n - T = ^(1 - Tii) > (1 - 0(e}/"))2(l - c)n, 

i 

a contradiction. □ 

Note that the assumption ej^^ ~ 0{ri) does not imply that 7 is 156eJ^^-far from {0, 1}. Indeed, if c 
is close to 1/2 then 7 is close to {0, 1}. Hence, it is necessary for us to split the proof of Lemma [?lT] into 
the two cases above. 

The analogue of Corollary 22 now follows, just as before. We state it without any prior assumptions. 
Corollary I22f . Suppose that n > 4 and 

^ < ei < Calf, (13) 



,7/3 



where cq > is an absolute constant. Let T C Sn he a family of permutations with size \T\ — c ■ n\, 
satisfying 

where f — 2xt ~ 1; o,''^d fi is the orthogonal projection of f onto Ui. Then there exists a family G C Sn 

1/7 

which is a union of dn disjoint 1-cosets, where d is 0{€^ )-close to c, satisfying 

\g^F\ < 0{e\")n\. 



Getting rid of the assumptions on ei CoroUarv I22II has a drawback: it needs to assume that ei is 
not too small and not too large. When ei is large enough (depending on 77), the statement holds trivially, 
so we may focus our attention on the case where ei is small. Intuitively, having ei small should work 
in our favor. We shall introduce a few artificial errors to increase ei, and then later on take account of 
them, by introducing an extra error term into our conclusion statement. 
We start by showing how to artificially increase ei. 

Lemma 25. Let J- C Sn, and let v < 1/16. Then there exists a family H d Sn such that 

l-FAHl < V^n! and v < E[{h ~ hi f] < {^/E[{f - /i)2] + 2^/^)^ 

where f = 2xt ~ 1, h — 2x-h ~~ 1 o,nd /i, h\ are the projections of /, h into U\. 
Moreover, if \T\ > n!/2 then TL <Z T , and otherwise Ji Z) T. 
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Proof. By taking complements if necessary, we may assume that > nl/2. Let sgn(7r) denote the 
sign of a permutation tt. Since n > 3, the sign function is orthogonal to Ui (this is because the sign 
representation is not a constituent of the permutation representation), and so 



IE[(/~/i)'] > (/,sgn)2 = (EsgnW/w)'. (14) 



First, assume that at least half of the permutations in T are even. Then the number of these is at least 
n!/4 > ^/vn\. Define Q (and so g) by removing ^/vnl of them. We have 

Esgn(7r)(;(7r) — Esgn(7r)/(7r) — 

Therefore, either sgn(7r)/(7r) > ^/v, or E^ sgn(7r)f/(7r) < ~y/v. In the former case, we take % — F, 
and we are done by the inequality |(14)[ In the latter case, we take % — Q. The inequality |(14)| shows 
that E[(/i — hif'] > V. Moreover, since projections are contracting, we have 

\\h-h^h < ll/-/l||2 + ||(/l-/ll) + (/-/l)||2 < ||/-/l||2 + ||/l-/||2< ||/-/l||2+2V^. 

Similarly, if at least half of the permutations in F are odd, then the number of these is at least n!/4 > 
^Jvn\. Define Q (and so g) by removing ^/vn\ of them. We have 

Esgn(7r).g(7r) = Esgn(7r)/(7r) + 2^/^. 

Therefore, either Ejr sgn(7r)/(7r) < —^/v, or sgn(7r)(;(7r) > ^/v, so we may continue as before. □ 

Using this trick and Corollarv 12211 we get our main theorem in full generality. 
Theorem 2. Let J- C Sn be a family of permutations with size \ J- \ = c - n\, satisfying 

E[(/-/i)'] 

where f — 2xjr — 1^ and fi is the orthogonal projection of f onto Ui. Then there exists a family G d Sn 
which is a union of dn disjoint 1-cosets, such that 

where 77 — min{c, 1 — c}. 

Proof. If n < 4 then the theorem is trivial (by taking the absolute constants implied by the O-terms to 
be sufficiently large), so we may assume that n > 4. If ei satisfies [(2) [[ then the theorem follows directly 
from Corollarv I22[ I Otherwise, there are two cases: ei is too large, and ei is too small. If ei > c^rf then 
the theorem holds, since ''/t? > Cq^ '', so suppose ei < n~''/^. 

Applv lLemma 25) with v — n^^/^ to obtain a family %. The value 62 = E[(/i — /ii)]^ satisfies 

1 f I 2 \^ 9 

< £2 < 



„7/3 - ^ - y„7/6 „7/6 J „7/3- 

Moreover, \!F/\'H\/n\ < n~'^^^ and so C2 E['H] satisfies |c— C2I < Also, 772 := min(c2,l — C2) 

satisfies 1 77 — 772 1 < n~''/^ as well. 

There are two cases: either £2 > co?72, o^' ^^ot. In the first case, > £^''772, and so 772 = 0{n^^/'^). 

Hence r/ = 0{ji~'^/'^) and so the theorem holds, since n~^l'^ lr\ — fi(l). 

The more interesting case is when £2 < co?72- Corollarv I22[ I applies, and we get a family Q C 5„ which 
is the union of dn disjoint 1-cosets, where 

\d - C2I = 0(62^^) and \gAn\ < 0{el^'^)n\. 

Since |c - C2I < n""^/^, e^^'' = 0{n-^/^) and | J'AH] < n-'^/^ ■ n\, the theorem follows. □ 

Remark. When ei > cqT]^ , the error terms 6^^/77 ensures that the theorem holds. Other error terms 
have the same effect, and so other versions of the theorem are possible. For example, instead of e^"^ /rj, 
we could have e[^'^ + {e[^'^ /rj)'^ . 
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4 Almost extremal isoperimetric sets in the transposition graph 



As explained in the introduction, the main reason for developing Fourier-theoretic stability results, such 
as the main theorem of this paper, is for applications in extremal combinatorics. Oftentimes, one must 
struggle to translate the combinatorial information in an extremal problem to the Fourier language, 
but there is one setting in which the translation is almost immediate (yet may demand certain non- 
trivial calculations.) That is the setting of normal Cayley graphs on groups, and characterization of the 
maximum-sized independent sets, or the sets of minimum edge-expansion, in those graphs. See [S] for a 
more complete description of this. In a nutshell, there are good characterizations relating edge-expansion 
in graphs to the eigenvalues and eigenvectors of the graph, namely, the theorems of Alon-Milman [T] and 
Dodziuk A Cayley graph whose generating set is closed under conjugation is known as a normal 
Cayley graph. For any normal Cayley graph on a group F, its eigenspaces are precisely the isotypical 
suhspaces of (the subspaces consisting of functions whose Fourier transform is concentrated on a 
fixed irreducible representation of F). Furthermore, the eigenvalues are given by a formula involving the 
average of the character of the corresponding representation on the generating set of the graph. 

The example of the above phenomenon which we have in mind is the application of the Alon- 
Milman/Dodziuk theorems to the Cayley graph on Sn generated by the transpositions. In other words, 
the graph G with V{G) = Sn, and 

E{G) — {{cr, r} : crr^^ is a transposition} 

— two permutations are joined if they differ by a transposition. For any set A C V{G), we let dA denote 
the edge-boundary of A, i.e. the set of edges between A and its complement. As explained in [5], by 
using Dodziuk/ Alon-Milman, the work of Diaconis and Shashahani [3^ yields the following theorem: 

Theorem 4 (Diaconis and Shashahani). Let A C Sn with \A\ = cnl. Then 

\dA\ > (l ~ c)n\A\, (15) 

with equality if and only if the characteristic vector of A belongs to Ui . 

The characterization of Boolean functions in Ui given in j8j immediately yields the following charac- 
terization of the extremal isoperimetric sets. 

Corollary 26. Let A C Sn, with \A\ = cnl, and \dA\ = (1 — c)n|y^|. Then A is a dictatorship. 

We now want a stability version of this. In [B], Lemma 13, we prove a stability version of Dodziuk/ Alon- 
Milman, which, when combined with the eigenvalue estimates in [3], shows that any set which has 
edge-boundary close to the minimum, must have its characteristic vector very close (in norm) to Ui: 

Theorem 5 (Lemma 13 in [6j). Let A C Sn with \A\ ~ cnl. If 

\dA\ < {l-c + 5o)n\A\, 

then 

E[if - fif] < ^cSo, 
n — 2 

where f is the characteristic vector of A, and fi is its projection on Ui. 

Combining this with Theorem [5] immediately yields the following. 
Theorem 6. Let A C Sn with \A\ = cnl. If 

\dA\ < {l-c + Sn)n\A\ 
then there exists a dictatorship B C 5„ with 

^ - 

We may apply a perturbation argument similar to the one in [6] to prove the following strengthening 
of Theorem [51 
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Theorem 7. There exists rjQ > such that the following holds. Let A C Sn with \A\ = cn\, where 
niin(c, 1 — c) > rjQ. If 

\dA\ < (l-c + ^o)n|^| 
then there exists a dictatorship B C Sn with 

\AAB\ 

= 0{So). 

This is best possible up to an absolute constant factor, as can be seen by taking 

^ = Tn U Ti2 U . . . U Tia U T2,a+1 U T2,a+2 U . . . U T2,a+b, 

where minja/n, 1 — a/n} = and b/a = 8((5o). 

Therefore, a subset of Sn with measure bounded away from and 1, which has edge-boundary close 
to the lower bound [(15)] must be close in structure to a dictatorship. This is a 'genuine' stability result. 
One may contrast it with the 'quasi- stability' result in [6], where we prove that a subset of S„ with size 
Q{{n—l)l) has edge-boundary close to the minimum iff it is close in structure to a union of dictatorships, 
as opposed to a single dictatorship. 



5 Conclusion 

Remarks and open questions 

The most obvious open question in the context of this trilogy is whether it is possible to prove the 
common generalization of the main theorems in all three papers. Is it true that no matter what the 
expectation of / is, if / is Boolean and close to Ut, then it is close to a union of i-cosets? This surely 
must be true, but our techniques fall short of proving it. We also believe the correct dependence between 
the two distances to be linear. We make the following conjecture. 

Conjecture 1. Let A C Sn, and let < G N. Let f denote the characteristic function of A, and let ft 
denote the orthogonal projection of f onto Ut- If 

E[{f - ftf] < eE[f], 

then there exists a family B C Sn which is a union of t-cosets, such that 

\AAB\ < Coe\A\, 

where Cq is an absolute constant. 

Another related question involves understanding the precise extremal isoperimetric sets in the trans- 
position graph on Sn, for all set-sizes. Limor Ben Efraim conjectures that the minimum edge-boundary 
is always achieved by an initial segment of the lexicographical order on Sn- (If c, tt e Sn, we say that 
a < IT in the lexicographic order if a{j) < 7r(j), where j = min{i G [n] : a(i) ^ iT{i)}.) 

It would also be interesting to discover other groups where there is an elegant characterization of 
Boolean functions whose Fourier support is concentrated on certain irreducible representations. 
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